首页> 外文OA文献 >A Loop-Based Methodology for Reducing Computational Redundancy in Workload Sets
【2h】

A Loop-Based Methodology for Reducing Computational Redundancy in Workload Sets

机译:一种基于环路的减少计算冗余的方法   工作负载集

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The design of general purpose processors relies heavily on a workloadgathering step in which representative programs are collected from variousapplication domains. Processor performance, when running the workload set, isprofiled using simulators that model the targeted processor architecture.However, simulating the entire workload set is prohibitively time-consuming,which precludes considering a large number of programs. To reduce simulationtime, several techniques in the literature have exploited the internal programrepetitiveness to extract and execute only representative code segments.Existing so- lutions are based on reducing cross-program computationalredundancy or on eliminating internal-program redundancy to decrease executiontime. In this work, we propose an orthogonal and complementary loop- centricmethodology that targets loop-dominant programs by exploiting internal-programcharacteristics to reduce cross-program computational redundancy. The approachemploys a newly developed framework that extracts and analyzes core loopswithin workloads. The collected characteristics model memory behavior,computational complexity, and data structures of a program, and are used toconstruct a signature vector for each program. From these vectors,cross-workload similarity metrics are extracted, which are processed by a novelheuristic to exclude similar programs and reduce redundancy within the set.Finally, a reverse engineering approach that synthesizes executablemicro-benchmarks having the same instruction mix as the loops in the originalworkload is introduced. A tool that automates the flow steps of the proposedmethodology is developed. Simulation results demonstrate that applying theproposed methodology to a set of workloads reduces the set size by half, whilepreserving the main characterizations of the initial workloads.
机译:通用处理器的设计在很大程度上依赖于工作负载收集步骤,在该步骤中,从各种应用程序域中收集代表性程序。运行工作负荷集时,处理器性能是通过使用模拟目标处理器体系结构的模拟器来进行配置的。但是,模拟整个工作负荷集非常耗时,因此无法考虑使用大量程序。为了减少仿真时间,文献中有几种技术利用内部程序的重复性来仅提取和执行代表性的代码段。现有的解决方案基于减少跨程序的计算冗余或消除内部程序的冗余以减少执行时间。在这项工作中,我们提出了一种正交且互补的以循环为中心的方法,该方法通过利用内部程序特性来减少跨程序的计算冗余,从而以循环为主导的程序为目标。该方法采用了新开发的框架,该框架可提取和分析工作负载中的核心循环。所收集的特征对程序的存储行为,计算复杂性和数据结构进行建模,并用于为每个程序构造签名向量。从这些向量中提取跨工作量相似性度量,并通过新颖的启发式方法对其进行处理,以排除相似的程序并减少集合中的冗余。最后,一种反向工程方法,可合成与指令循环中具有相同指令混合的可执行微基准。介绍了原始工作量。开发了一种自动化所提出方法的流程步骤的工具。仿真结果表明,将建议的方法应用于一组工作负载可将设置大小减少一半,同时保留初始工作负载的主要特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号